Search CORE

19 research outputs found

Recommended from our members

Learning to Fix Build Errors with Graph2Diff Neural Networks

Author: Aftandilian E
Chen Z
Manzagol PA
Moitra S
Rice Andrew
Sutton C
Tarlow D
Publication venue: Proceedings - 2020 IEEE/ACM 42nd International Conference on Software Engineering Workshops, ICSEW 2020
Publication date: 27/06/2020
Field of study

Professional software developers spend a significant amount of time fixing builds, but this has received little attention as a problem in automatic program repair. We present a new deep learning architecture, called Graph2Diff, for automatically localizing and fixing build errors. We represent source code, build configuration files, and compiler diagnostic messages as a graph, and then use a Graph Neural Network model to predict a diff. A diff specifies how to modify the code’s abstract syntax tree, represented in the neural network as a sequence of tokens and of pointers to code locations. Our network is an instance of a more general abstraction which we call Graph2Tocopo, which is potentially useful in any development tool for predicting source code changes. We evaluate the model on a dataset of over 500k real build errors and their resolutions from professional developers. Compared to the approach of DeepDelta [23], our approach tackles the harder task of predicting a more precise diff but still achieves over double the accuracy

Apollo (Cambridge)

Learning to Fix Build Errors with Graph2Diff Neural Networks

Author: Aftandilian E
Chen Z
Manzagol PA
Moitra S
Rice A
Sutton C
Tarlow D
Publication venue: Proceedings - 2020 IEEE/ACM 42nd International Conference on Software Engineering Workshops, ICSEW 2020
Publication date: 04/11/2019
Field of study

arXiv.org e-Print Archive

Apollo (Cambridge)

Methods for visual mining of genomic and proteomic data atlases

Author: A Barsky
A D'Angelo
A Inselberg
A Inselberg
AI Saeed
B Aranda
B Shneiderman
B Shneiderman
B Topol
BE Davy
C Anhlberg
C Plaisant
D Holton
D Komura
DL Masica
DW Parsons
E Aftandilian
E Bier
E Gansner
E Tufte
F Desiere
F van Ham
HR Dawe
I Herman
J Boyle
J Boyle
J Boyle
J Handcock
J Kim
J Seo
J Vlasblom
John Boyle
K Chen
L Anderson
L Licata
L Zhang
M Bostock
M Krywinski
M Krzywinski
M Lukk
M Suderman
M Tanaka
MY Brusniak
N Wong
P Shannon
PH Eilers
R Becker
R McLendon
R Santamaria
R Santamaria
R White
Richard Kreisberg
Ryan Bressler
S Killcoyne
S Maere
Sarah Killcoyne
T Ideker
T Sjoblom
W De Pauw
Publication venue: BioMed Central
Publication date: 01/01/2012
Field of study

Abstract Background As the volume, complexity and diversity of the information that scientists work with on a daily basis continues to rise, so too does the requirement for new analytic software. The analytic software must solve the dichotomy that exists between the need to allow for a high level of scientific reasoning, and the requirement to have an intuitive and easy to use tool which does not require specialist, and often arduous, training to use. Information visualization provides a solution to this problem, as it allows for direct manipulation and interaction with diverse and complex data. The challenge addressing bioinformatics researches is how to apply this knowledge to data sets that are continually growing in a field that is rapidly changing. Results This paper discusses an approach to the development of visual mining tools capable of supporting the mining of massive data collections used in systems biology research, and also discusses lessons that have been learned providing tools for both local researchers and the wider community. Example tools were developed which are designed to enable the exploration and analyses of both proteomics and genomics based atlases. These atlases represent large repositories of raw and processed experiment data generated to support the identification of biomarkers through mass spectrometry (the PeptideAtlas) and the genomic characterization of cancer (The Cancer Genome Atlas). Specifically the tools are designed to allow for: the visual mining of thousands of mass spectrometry experiments, to assist in designing informed targeted protein assays; and the interactive analysis of hundreds of genomes, to explore the variations across different cancer genomes and cancer types. Conclusions The mining of massive repositories of biological data requires the development of new tools and techniques. Visual exploration of the large-scale atlas data sets allows researchers to mine data to find new meaning and make sense at scales from single samples to entire populations. Providing linked task specific views that allow a user to start from points of interest (from diseases to single genes) enables targeted exploration of thousands of spectra and genomes. As the composition of the atlases changes, and our understanding of the biology increase, new tasks will continually arise. It is therefore important to provide the means to make the data available in a suitable manner in as short a time as possible. We have done this through the use of common visualization workflows, into which we rapidly deploy visual tools. These visualizations follow common metaphors where possible to assist users in understanding the displayed data. Rapid development of tools and task specific views allows researchers to mine large-scale data almost as quickly as it is produced. Ultimately these visual tools enable new inferences, new analyses and further refinement of the large scale data being provided in atlases such as PeptideAtlas and The Cancer Genome Atlas.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Safety Management Information Statistics (SAMIS). 1993 annual report.

Author: Aftandilian E.
Thompson A.
Publication venue
Publication date
Field of study

Federal Transit Administration, Washington, D.C.Mode of access: Internet.Author corporate affiliation: Unisys Corporation, Cambridge, Mass.Report covers the period Jan - Dec 199

University of Michigan Library Repository

GC assertions

Author: Edward E. Aftandilian
Mitchell N.
Samuel Z. Guyer
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date
Field of study

Crossref

What can the GC compute efficiently? A language for heap assertions at GC time

Author: Reichenbach Christoph
Immerman Neil
Smaragdakis Yannis
Aftandilian Edward E.
Guyer Samuel Z.
Publication venue: Association for Computing Machinery (ACM)
Publication date: 01/10/2010
Field of study

We present the DEAL language for heap assertions that are efficiently evaluated during garbage collection time. DEAL is a rich, declarative, logic-based language whose programs are guaranteed to be executable with good whole-heap locality, i.e., within a single traversal over every live object on the heap and a finite neighborhood around each object. As a result, evaluating DEAL programs incurs negligible cost: for simple assertion checking at each garbage collection, the end-to-end execution slowdown is below 2%. DEAL is integrated into Java as a VM extension and we demonstrate its efficiency and expressiveness with several applications and properties from the past literature. Compared to past systems for heap assertions, DEAL is distinguished by its very attractive expressiveness/efficiency tradeoff: it offers a significantly richer class of assertions than what past systems could check with a single traversal. Conversely, past systems that can express the same (or more) complex assertions as DEAL do so only by suffering ordersof-magnitude higher costs

Lund University Publications

Registro Nacional de Trabajos de Investigación y Proyectos

Repositorio Académico USMP